Computer Architecture 
I/O Systems 


The Input/ Output Subsystem 


■ Input and output (I/O) devices allow us to communicate with the 
computer system. 

■ I/O is the transfer of data between primary memory and various I/O 
peripherals. 

■ Input devices: keyboards, mice, card readers, scanners, voice recognition 
systems, and touch screens (data in) 

■ Output devices: monitors, printers, plotters, and speakers (data out) 

■ These devices are not connected directly to the CPU. Instead, there is an 
interface that handles the data transfers. This interface converts the 
system bus signals to and from a format that is acceptable to the given 
device. 

■ The CPU communicates to these external devices via input/output 
registers 



This exchange of data is performed in two wavs : 

■ In memory-mapped I/O : the registers in the interface 
appear in the computer's memory map and there is no real 
difference between accessing memory and accessing an I/O 
device. Clearly, this is advantageous from the perspective of 
speed, but it uses up memory space in the system. 

■ Instruction-based I/O : the CPU has specialized 
instructions that perform the input and output. Although this 
does not use memory space, it requires specific I/O 
instructions, which implies it can be used only by CPUs that 
can execute these specific instructions. 

■ Interrupts play a very important part in I/O, because they 
are an efficient way to notify the CPU that input or output is 
available for us. 


Interrupts 

Important question: 

■ How these components interact with the processor??? 

■ Interrupts are events that alter (or interrupt) the 
normal flow of execution in the system. 

An interrupt can be triggered for a variety of reasons, 
including: 

■ I/O requests. 

■ Arithmetic errors (e.g., division by zero). 

■ Arithmetic overflow. 

■ Hardware malfunction (e.g., memory parity error). 


Interrupts (continued) 


■ User-defined break points (such as when debugging 
a program). 

■ Page faults. 

■ Invalid instructions (usually resulting from pointer 
issues). 

■ The actions performed for each of these types of 
interrupts (called interrupt handling ) are very 
different. 

■ Because telling the CPU that an I/O request has 
finished is much different from terminating a 
program because of division by zero. 
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Interrupt 


□ Can be initiated by the user or the system. 

□ Can be maskable (disabled or ignored) or 
nonmaskable {a high priority interrupt that cannot 
be disabled and must be acknowledged). 

□ Can occur within or between instructions. 

□ May be synchronous (occurs at the same place 
every time a program is executed) or 
asynchronous (occurs unexpectedly). 

□ Can result in the program terminating or 
continuing execution once the interrupt is 
handled. 


Bus organization and transfer 


■ The CPU communicates with the other components 
via a bus. 

■ A bus is a set of wires that acts as a shared but 
common data path to connect multiple subsystems 
within the system. 

■ It consists of multiple lines, allowing the parallel 
movement of bits. 

■ Buses are low cost but very versatile, and they make 
it easy to connect new devices to each other and to 
the system. 

■ At any one time, only one device (be it a register, the 
ALU, memory, or some other component) may use the 
bus. 

■ However, this sharing often results in a 
communication bottleneck. 

■ The speed of the bus is affected by its length as well 
as by the number of devices sharing it. 


(a) Point-to-Point Buses: connecting two specific components 


(b) A Multipoints Bus: common pathway that connects a number 
of devices 
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A typical bus consists: of data lines, address lines, control 
lines, and power lines . 


Power 




What does each line-set contain? 


1- Data bus. 

■ The lines of a bus is dedicated to moving data . 

■ These data lines contain the actual information that must be 
moved from one location to another. 


2- Control lines 

■ These lines indicate which device has permission to use the 
bus and for what purpose (reading or writing from memory or 
from an I/O device, for example). 

■ They also transfer acknowledgments for bus requests, 
interrupts, and clock synchronization signals. 

3- Address fines 

■ They indicate the location (in memory, for example) that the 
data should be either read from or written to. 


The power lines: 

■ They provide the electrical power necessary. 

■ T ypical bus transactions include: 

a) sending an address (for a read or write), 

b) transferring data from memory to a register (a 
memory read), 

c) transferring data to the memory from a register (a 
memory write). 

■ In addition, buses are used for I/O reads and 
writes from peripheral devices. 

■ Each type of transfer occurs within a bus cycle, the 
time between two ticks of the bus clock. 


Types of buses according to connected components 


■ Processor-memory buses are short, high-speed buses 
that are closely matched to the memory system on the 
machine to maximize the bandwidth (transfer of data) and 
are usually very design specific. 

■ I/O buses are typically longer than processor-memory 
buses and allow for many types of devices with varying 
bandwidths. These buses are compatible with much 
different architecture. 

■ A backplane bus (system bus) is actually built into the 
chassis of the machine and connects the processor, the I/O 
devices, and the memory (so all devices share one bus) 


High-performance systems often use all three types of buses. 






Common Bus system configuration: 


■ A more efficient scheme for transferring 
information between common bus registers in a 
multiple-register configuration is a common bus 
system. 

■ A bus structure consists of a set of common lines, 
one for each bit of a register, through which binary 
information is transferred one at a time. 

■ Control signals determine which register is selected 
by the bus during each particular register transfer. 

Constructing a common bus system 

a. Using multiplexers 

b. Using three state buffers 


Using multiplexers. 

■ One way of constructing a common bus system is 
with multiplexers. 

■ The multiplexers select the source register whose 
binary information is then placed on the bus. 
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Register A. 


■ In general, a bus system will multiplex k registers of n bits 
each to produce an n-line common bus. 

■ The number of multiplexers needed to construct the bus is 
equal to n. 

■ The size of each multiplexer must be k x 1 since it 
multiplexes k data lines. 

For example : 

■ A common bus for eight registers of 16 bits each requires 
16 multiplexers, one for each line in the bus. 

So: 

■ Each multiplexer must have eight data input lines and three 
selection lines to multiplex one significant bit in the eight 
registers. 


Using three state buffers. 


■ A bus system can be constructed with three-state 
gates instead of multiplexers. 

A three-state gate: 

■ Is a digital circuit that exhibits three states. Two of 
the states are signals equivalent to logic 1 and 0 as in 
a conventional gate. The third state is a high- 
impedance state. 

■ The high-impedance state behaves like an open circuit, 
which means that the output is disconnected and does 
not have logic significance. 
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( b) Truth table 


Bus line for bit 0 



■ When the control input of the buffer is equal to 1, 
the output is enabled and the gate behaves like any 
conventional buffer, with the output equal to the 
normal input. 

■ When the control input is 0, the output is disabled 
and the gate goes to a high-impedance state (Hi-Z), 
regardless of the value in the normal input. 


Instruction Set Architecture (ISA) 
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Instructions 


• Instructions are the “words” of a computer 


• Instruction set architecture (ISA) is its 
vocabulary 

• This defines most of the interface to the 
processor. 


Instruction Set Design 



■ Multiple Implementations: 8086 -> Pentium 4 

■ ISAs evolve: MIPS-I, MIPS-II, MIPS-II, MIPS-IV, 
MIPS,MDMX, MIPS-32, MIPS-64 


Basics 


■ C statement 

f = (9 + h) - (i + j) 

■ MIPS instructions 
add to, g, h 

add tl, i, j 
sub f, to, tl 

■ Opcode/mnemonic 

■ Operands, source/destination 


Basics 


■ Opcode: specifies the kind of operation (mnemonic) 

■ Operands: input and output data (source/destination) 

■ Operands to & tl are temporaries 

■ One operation, two inputs, one output 

■ Multiple instructions for one C statement 


P rocessor Execution Cycle 
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Obtain instruction from program storage 

Determine required actions and instruction size 

Locate and obtain operand data 

Compute result value or status 

Deposit results in register or storage for later use 

Determine successor instruction 


Addressing Modes 


Addressing Modes 


■ Addressing modes are an aspect of the instruction set 
architecture in most central processing unit (CPU) designs. 


The various addressing modes that are defined in a given 
instruction set architecture define how machine 
language instructions in that architecture identify 
the operand (or operands) of each instruction. 

■ An addressing mode specifies how to calculate the 
effective memory address of an operand by using information 
held in registers and/or constants contained within a machine 
instruction or elsewhere. 


Addressing Modes 


■ Immediate 

■ Direct 

■ Indirect 

■ Register 

■ Register Indirect 

■ Displacement (Indexed) 

■ Stack 


Immediate Addressing 

■ Operand is a part of instruction 

■ Operand = address field 

■ e.g. ADD 5 

□ Add 5 to contents of accumulator 

□ 5 is operand 

■ No memory reference to fetch data 

■ Fast 

■ Limited range 


Immediate Addressing Diagram 
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Direct Addressing 


■ Address field contains address of operand 

■ Effective address (EA) = address field (A) 

■ e.g. ADD A 

□ Add contents of cell A to accumulator 

□ Look in memory at address A for operand 

■ Single memory reference to access data 

■ No additional calculations to work out effective address 

■ Limited address space 


Direct Addressing Diagram 


Instruction 



Indirect Addressing 


■ Memory cell pointed to by address field contains the 
address of (pointer to) the operand. 

■ EA = (A) 

□ Look in A, find address (A) and look there for operand 

■ e.g. ADD (A) 

□ Add contents of cell pointed to by contents of A to 
accumulator 


Indirect Addressing Diagram 


Instruction 



Indirect Addressing 


■ Large address space: 2 n where n = word length 

■ May be nested, multilevel, cascaded 
□ e.g. EA = (((A))) 

■ Multiple memory accesses to find operand 


Hence slower 


Register Addressing 

■ Operand is held in register named in address filed 

■ EA = R 

■ Limited number of registers 

■ Very small address field needed 

□ Shorter instructions 

□ Faster instruction fetch 


Register Addressing Diagram 


Instruction 



Register Addressing 


■ No memory access 

■ Very fast execution 

■ Very limited address space 

■ Multiple registers helps performance 

□ Requires good assembly programming or compiler writing 


Register Indirect Addressing 


■ like indirect addressing 

■ EA = (R) 

■ Operand is in memory cell pointed to by contents of 
register R 

■ Large address space (2 n ) 


One fewer memory access than indirect addressing 


Register Indirect Addressing Diagram 


Instruction 



Displacement Addressing 

- EA = A + (R) 


■ Address field hold two values 

□ A = base value 

□ R = register that holds displacement 


Displacement Addressing Diagram 
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Relative Addressing 

■ A version of displacement addressing 

■ R = Program counter, PC 

■ EA = A + (PC) 

■ get operand from A cells from current location 
pointed to by PC 


Indexed Addressing 

■ R1 = base 

■ R2 = displacement 

■ EA = R1 + R2 

■ Good for accessing arrays 

□ EA = R1 + R2 

□ R2++ 


Stack Addressing 


Stack: First-In Last-Out data structure (FILO) 
Instruction operands 

- None for ALU operations 

- One for push/pop 

Advantages: 

- Short instructions 

- Compiler is easy to write 

Disadvantages 

- Code is inefficient 

• Fix: random access to stacked values 

- Stack size & access latency 

• Fix : register file or cache for top entries 

Examples 

- 60s: Burroughs B5500/6500, HP 3000/70 

- Today: Java VM 




Accumulator Addressing 


Single register (accumulator) 
Instructions 


I ACC U-^ 


- ALU (Acc <— Acc + *M) 

- Load to accumulator (Acc <— *M) 

1 


' 

- Store from accumulator (*M <— Acc) 

\ALU/ 

• Instruction operands 



- One explicit (memory address) 



- One implicit (accumulator) 


f 


• Attributes: 

- Short instructions 

- Minimal internal state; simple design 

- Many loads and stores 

• Examples: 

- Early machines: IBM 7090, DEC PDP-8 

- Today: DSP architectures 



A=B+ (C*D) 
load C 
mul D 
add B 
store A 
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Register-Set Architectures 


• General Purpose Registers (GPRs) 

• Registers: 

- Explicitly managed memory for holding 
recently used values 

• The dominant architecture: CDC 6600, 
IBM 360/370, PDP-1 1 , 68000, RISC 

• Advantages: 

- Allows fast access to temporary values 

- Permits clever compiler optimization 

- Reduced traffic to memory 

• Disadvantages: 

- Longer instructions (than accumulator 
designs) 
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Instruction Format 




Register-to-Register: 

• No memory addresses in ALU ops 

-j 

• Typically 3-operand ALU ops 

- Bigger encoding, but simplifies 
register allocation 

• Advantages 

is 

- Simple fixed-length instructions 

- Easily pipelined 

• Disadvantages 

- Higher instruction count 

• Examples 

- CDC6600, CRAY- 1 , most RISCs 


Load-Store 




Register-to-Memory Architectures 






One memory address in ALU ops 
Typically 2-operand ALU ops 
Advantages 

- Small instruction count 

- Dense encoding 

Disadvantages 

- Result destroys an operand 

- Instruction length varies 

- Clocks per instruction varies 

- Harder to pipeline 

Examples 

- IBM 360/370, VAX 



Memory- to-Memory Architectures 




All ALU operands from memory addresses 

Advantages 

- No register wastage 

- Lowest instruction count 

Disadvantages 

- Large variation in instruction length 

- Large variation in clocks per instructions 

- Huge memory traffic 

Examples 

- VAX 



D = 

B + ( C*D ) 

mul 

D 

<- CAD 

add 

D 

<- D+B 


Thanks for your attention 
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